QUANTITATIVE ERGODIC THEOREMS AND THEIR 
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^ Abstract. We present a survey of ergodic theorems for actions of algebraic 

(^— •) and arithmetic groups recently established by the authors, as well as some of 

^Sj their applications. Our approach is based on spectral methods employing the 

^ unitary representation theory of the groups involved. This allows the derivation 

C^ of ergodic theorems with a rate of convergence, an important phenomenon which 

■^^ does not arise in classical ergodic theory. Quantitative ergodic theorems give 

I -^ rise to new and previously inaccessible applications, and we demonstrate the 

psj remarkable diversity of such applications by presenting several number-theoretic 

results. These include, in particular, general uniform error estimates in lattice 
points counting problems, explicit estimates in sifting problems for almost-prime 
points on symmetric varieties, bounds for exponents of intrinsic Diophantine 
approximation, and results on fast distribution of dense orbits on homogeneous 
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1. Introduction 

The goals of the present survey are to describe some mean and pointwise ergodic 
theorems for actions of algebraic groups and their lattice subgroups, and to give 
an exposition of some of the diverse applications that such ergodic theorems have, 
particularly to counting, equidistribution and Diophantine approximation problems 
in number theory. The results described below have been recently developed by 
the authors, and constitute an expansion of the range of applications of spectral 
methods in ergodic theory, particularly in the context of dynamics on homogeneous 
space. The survey is however limited in scope in that it describes only the authors' 



Date: April 26, 2013. 



2 ALEXANDER GORODNIK AND AMOS NEVO 

specific spectral approach, and it does not discuss other approaches within the very 
large and rapidly expanding field of homogeneous dynamics and its application in 
number theory. 

In order to situate our approach in the proper context, we start by describing 
below some elements in the interesting story of the development of the mean er- 
godic theorem, beginning with its formulation by von-Neumann in 1932. Let us 
however note here very briefiy the main new points in our discussion. First, the 
ergodic theorems, which we establish, are developed using spectral methods and 
representation theory of non-amenable algebraic groups, subjects left out of most 
of traditional ergodic theory. Second, these ergodic theorems provide a rate of con- 
vergence to the ergodic limit, a remarkable and most useful phenomenon that does 
not arise in classical ergodic theory. Third, the rate of convergence allows new and 
previously inaccessible applications of ergodic theorems to be developed, using the 
fact that the actions of the groups we study are closely connected to a variety of 
very natural number-theoretic problems. 

To demonstrate the utility and diversity of such applications, let us briefiy de- 
scribe a selection of counting, equidistribution, and Diophantine approximation 
problems that may at first sight appear completely unrelated. The main rationale 
of our exposition in the present survey is the fact, which will be explained in detail 
below, that solutions to all of the following problems depend on applying a suitable 
ergodic theorem to the case under consideration. 

1. The uniform lattice point counting problem. Let F be a discrete subgroup 
of a locally compact group G such that the space G/T carries a finite G-invariant 
measure. The lattice point counting problem is to count the number of elements of 
F in an increasing family of bounded domains Bt C G, and ideally to establish an 
asymptotic formula 

with K < 1 as small as possible. Some questions that arise here are, for example, 
what is the asymptotics of the number of unimodular integral matrices A lying in 
the norm ball ||y4|| < t as t — )■ cxo, or in more general domains in the matrix space ? 
What is the asymptotics for the number of such matrices A satisfying a congruence 
condition A = Aq mod g? More generally, what is the asymptotics of the number 
of unimodular matrices with entries in the ring of integers of an algebraic number 
field, which are of bounded height? 

Counting lattice points is in fact the most basic of all mean ergodic theorems 
that we consider. This point of view that will be explained in full in §5 below, 
where we will describe a general solution to the lattice point counting problem. 

2. Almost prime points on algebraic varieties. As we shall see below, es- 
tablishing robustness and uniformity properties of the solution to the lattice point 
counting problem is crucial in several important applications, which depend on 
answering more refined counting questions. Of those, let us mention the problem 
of existence of almost prime integral points on homogeneous varieties, raised in 
|NS10] . Consider for example the set of symmetric integral positive-definite matri- 
ces of fixed determinant, and call such a matrix prime if all its entries are prime 
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numbers. How large is the set of prime points in the set of integral points? Less 
ambitiously, how large is the set of integral points all of whose entries are r-almost 
primes (i.e., products of at most r prime factors) ? 

This question has been the subject of intensive activity in recent years, and we 
refer to |BGS10I | and the recent surveys [ Lul2 ] and p^ for further discussion. In §6 
below we will utilise uniformity in the solution of the lattice point counting problem 
to establish an asymptotic lower bound of the right magnitude for the number of 
r-prime points lying on symmetric varieties. 

3. Diophantine approximation on algebraic varieties. Consider the ratio- 
nal ellipsoid {(x, y, z) ; ax^ + by"^ + cz^ = d}, a,b,c,d G N+. Assuming that the 
rational points on the ellipsoid are dense, is it possible to establish a rate of ap- 
proximation of a general points on the ellipsoid by rational points ? Is it possible to 
establish a rate of approximation by rational points satisfying additional integrality 
constraints ? 

These problems, in the much more general context of homogeneous algebraic 
varieties, were raised in S. Lang's 1965 Report on Diophantine Approximation 
|L65] . We will state this problem precisely in §7 and describe a general solution, 
obtained jointly with A. Ghosh, in the case where the variety is homogeneous under 
an action a simple algebraic group. For the rational ellipsoids we establish a rate 
of approximation by rational points satisfying integrality constrains that is best 
possible in a natural sense. 

4. Equidistribution of typical orbits on homogeneous spaces. Consider 
the one-sheeted hyperboloid {(x, y, z) ; ax^ + hy^ — cz^ = d}, a, b,c,d & N+, which 
is also known as de-Sitter space. The group of integral matrices preserving the 
corresponding quadratic form naturally acts on this space, and typical orbits are 
dense, but does a typical orbit have a limiting distribution in a suitable sense ? 

This problem, for general lattices in the isometry group of a Lorentz form acting 
on de-Sitter space of arbitrary dimension, was raised by V. Arnold in 1985 [Arj . A 
precise quantitative solution for general homogeneous varieties will be described in 
§9. 

5. Equidistribution in algebraic number fields. The group T = SL2{7j[y/d]) , 

where d > is not a square, is a dense subgroup oi G = SL2(M). Consider elements 
of r of bounded norm : Ft = {7 G F : log(||7|p -|- ||7|P) < t}, where 7 denotes the 
Galois conjugate. Do these sets become equidistributed in G in a suitable sense ? 
We note that the action of F on G by translations is isometric, and the natural 
problem is to prove equidistribution for every single orbit, i.e. every coset of F. 
One possible such statement is a ratio equidistribution theorem, establishing that 
for nice domains fli,fl2 C G, 

IFinf^il If^il 

— 7- ——- as t — )► 00 , 



iFiHfial 1^2, 

with the convergence taking place at a definite rate. Such a result will be stated 
precisely and explained in §9. 

We remark that the problem of establishing a ratio equidistribution theorem 
for dense subgroups acting by isometries on the group or its homogeneous spaces 
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was raised originally by Kazhdan |K67j , in the case of a dense group of Euclidean 
isometries acting on the Euclidean plane. We refer to |Vj for a recent progress on 
this problem. 

6. Ergodic theorems for lattice subgroups. Finally, let us mention some natu- 
ral questions which constitute direct generalizations of the fundamental convergence 
problems of classical ergodic theory. Taking the group F = SL„(Z) as an example, 
consider its linear action on the torus T" = ]R"/Z". Does the mean ergodic theorem 
hold for the action of F on T" ? Specifically, let F^ = {7 G F ; log ||7|| < t}, and for 
/ G LP(T") and x G T", consider the averages 

' ' 7ert 

Do these averages converge to the space average J^„ f{x) dx in L^? Does conver- 
gence hold almost everywhere ? If so, is there a rate of convergence to the space 
average, in L^ and almost everywhere ? 

These questions also arise for other interesting actions of SL„(Z) (for instance, 
for the action on the space of unimodular lattices or for the action on its profinite 
completion), and they are all resolved by a general pointwise and mean ergodic 
theorem for the averages on norm balls in SL„(Z), a result that will be formulated 
more generally and precisely in §8 and explained there. 

To conclude the introduction, let us emphasize that it is ultimately the existence 
of an explicit rate of convergence in the ergodic theorems that is responsible for 
solution of the above problems. We will therefore give in our discussion high priority 
to the problem of establishing an explicit rate of convergence of the averages we 
consider to the space average in measure-preserving dynamical systems. 

To situate our approach in its proper context, let us now turn to describe some 
elements in the development of the mean ergodic theorem. We will also mention 
some of the milestones in the applications of spectral methods in ergodic theory. 
As we shall see, an interesting feature of this story is that the success of the clas- 
sical methods of ergodic theory is inextricably tied to both the impossibility of 
establishing a rate of convergence in the ergodic theorems, and the impossibility of 
extending the classical methods to classes of groups such as simple algebraic groups 
or their lattice subgroups. 

2. Averaging in ergodic theory 

2.1. The classical mean ergodic theorem. The basic objects of study in clas- 
sical Hamiltonian mechanics consist of a compact Riemannian manifold M, which 
is called the phase space, together with a divergence-free vector field on M. We 
denote by /i the Riemannian volume on M, normalised to be a probability mea- 
sure. The integral curves of the vector field give rise to a one-parameter group of 
volume-preserving transformations ag '■ M ^ M, s G M, which describes the time 
evolution in phase space. For a function / : M — )■ M, the time averages along orbits 
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are defined by 



1 /•* 

Af{x) = - f{asx) ds. 

'(' Jo 



One of tfie most fundamental questions in classical dynamics is the elucidation of 
their long term behavior. Two important questions that arise are : 

• do the time averages Atf converge at all, and if so, what is their limit? 

• how does the limit depend on the initial state x and the function /? 

The solution proposed by von Neumann to this problem was to consider for every 
measure-preserving map a^ : M — )■ M an associated unitary transformation, the 
Koopman operator Us : L'^{M) — )■ L'^{M), given by Usf\x) = f^agx). Thus the 
measure-preserving flow gives rise to a one-parameter group of unitary transfor- 
mations. This made it possible for von-Neumann to apply his recently developed 
spectral theory to prove the mean ergodic theorem jvN32] : 

Theorem 2.1 (von Neumann). Let {X,fi) be a (standard) probability space with 
a one-parameter flow of measure-preserving transformations a^ : X — )■ X, s G M. 
Then for any f G L'^{X), 

Atf -^Vf as t -)■ oo, 

in L'^-norm, where V denotes the orthogonal projection on the space of functions 
invariant under the flow. 

In particular, consider the case where the space of invariant functions in L'^{M) 
consists of constant functions only, or equivalently, every invariant measurable set 
has measure zero or one. Flows satisfying this property are called ergodic. Then the 
orthogonal projection of / to the space of constant functions is given by the "space 
average" J^ f dfi. Thus, for ergodic flows the time averages converge in L^-norm 
to the space average: 



Atf -^ f dfi as t — 7- oo, 
Jx 

verifying the "ergodic hypothesis" . 

Several years later, F. Riesz's |R38] gave an elegant elementary proof of the mean 

ergodic theorem, which became very influential in the subsequent development of 

ergodic theory. We will indicate it here in full, in order to highlight its main idea. 

Let Us : Ti ^ Ti, s E M., he a. one-parameter group of unitary operators on a 

Hilbert space "H and V denote the projection on the space of vectors invariant 

under the group. We will show that the averaging operators At = ]: Jq Us ds satisfy 

Atf — )■ Vf for every / G "H by breaking "H up to two orthogonal complementary 

subspaces and arguing separately in each of them. First, we apply At to vectors / 

of the form / = Ut^h — h. Then 



1 



t+to 



t > ' 



t+to rt 

Ush ds — I 

to Jo 

U.hds, 



Atf = AtiUtoh ~h) = j[ I Ushds- I Ushds 
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and therefore 

It 

II tJ w - ^ II II 

Since / is orthogonal to every invariant vector, namely Vf = 0, the mean ergodic 
theorem holds for / as stated, hence also for the closed linear span of such /'s. 
Obviously, the mean ergodic theorem holds in the space of invariant vectors. The 
proof is therefore complete upon observing that the span of Utgh — h with to ^ I^ 
and h eH is dense in the orthogonal complement of the space of invariant vectors. 
Notice that the main ingredient of the above proof is the property of asymptotic 
invariance (under translation) of the intervals [0,t], namely the fact that the mea- 
sure of the symmetric difference of [0,t] and [0,t] + to divided by the measure of 
[0, t] converges to zero at t — )■ oo. This crucial property of intervals on the real line 
admits a straightforward generalisation. Let G be a locally compact second count- 
able group G equipped with a left-invariant Haar measure. Define a one-parameter 
family of sets Ft G G oi positive finite measure to be asymptotically invariant if it 
satisfies 

' ,^, ' ^0 as t ^ oo. (2.1) 

uniformly for g in compact sets of G. A group admitting such an asymptotically 
invariant family (also called a F0lner family) is called amenable. 

Given a measure-preserving action of the group G on a probability space (X, /i), 
define for g E G the unitary Koopman operators 

7rxig):L\X)^L\X):f^fig-'x). 

It is straightforward to adapt Riesz's argument and conclude that amenable groups 
satisfy the mean ergodic theorem. Namely, denoting by Pt the uniform probabil- 
ity measures supported on an asymptotically invariant (F0lner) family Ft and by 
TTxiPt) the corresponding averaging operators in L^(X), we conclude that 

T^x{.Pt)f = vr— I nx{g)fdg^Vf as t -^ cx), 

in L^-norm. 

2.2. Equidistribution and pointwise convergence. So far we have discussed 
the convergence of the averages Tix{Pt)f in -^^-norm. The question as to the con- 
vergence nx{f3t)f{x) for individual points a; G X is of fundamental importance as 
well. 

We begin by describing the best-case scenario, namely equidistribution, and its 
characterisation in amenable dynamics. Let X be a compact metric space on which 
G acts continuously, and let Ft G G he a family of F0lner sets that define the 
corresponding averaging operators TCxiPt)- An extreme possibility is that for every 
continuous function / and for every point x G X, the "time averages" 7ix{(3t)f{x) 
converge uniformly to a limit m{f), which is independent of x. In that case, the 
functional / i— )■ m{f) defines a a probability measure m on X, which is invariant 
under G. Furthermore m is then the unique G-invariant probability measure on X, 
and the system is called uniquely ergodic. Conversely, when there exists a unique 
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G-invariant probability measure on X, the "time averages" of / converge uniformly 
to the space average /^ / dm. 

In general the situation is far more complicated, and the limit of the averages 
T^ x{,Pt) f {x) depends very sensitively on the initial point x. Typically there exist 
two (or even uncountably many) mutually singular invariant probability measures 
Hi and /i2 on X, such that the following holds. The time averages vrx(/3t)/(a;) of 
a continuous function / converge on a set Xi of full /xi-measure to J^ fd^i, and 
on a set X2 of full /i2-measure to J^ fdfi2, with the sets Xi and X2 disjoint, and 
/^ fd^i 7^ /^ fdfi2- The question of pointwise convergence for one-parameter flows 
is addressed by Birkhoff's pointwise ergodic theorem |B32j : 

Theorem 2.2 (Birkhoff). Let {X,fi) be a (standard) probability space with a one- 
parameter flow of measure-preserving transformations a^ : X — )■ X, s G M. Then 
for any f G L^{X), the time averages ^ f^ f{asx)ds converge for ^-almost every 
X eX toVf{x). 

An elementary proof of Birkhoff's theorem which relies on asymptotic invariance 
of intervals as well as their order structure was given by |KW82j . So far, this proof 
has not been generalized to other groups, including I?. The most successful ap- 
proach to the proof of poinwise ergodic theorems relies on controlling the maximum 
deviation between the time averages and the space average throughout the entire 
sampling process. Thus we consider the following fundamental quantity, called the 
maximal function associated to the flow : 

M/ = sup|7rx(A)/-^/|. 

In order to establish the pointwise convergence, we seek to prove the following two 
properties of the flow: 

I. Existence of a dense set of functions / G L^{X) for which iTx{f3t)f{x) con- 
verges for almost every x G X. 
II. The maximum error in the measurement performed by the time averages is 
controlled by the strong maximal inequality in L^(X): 

||A^/||2< const -ll/ll^, 

or by the weak maximal inequality in L^{X): 

fi{{Mf > r}) < const ■ r~^ 



Once these properties are established, it follows by a classical approximation ar- 
gument known as the Banach principle that the time averages converge almost 
everywhere for every / G L^{X). 

The proof of the first property proceeds by using asymptotic invariance of a 
F0lner family, which implies that pointwise convergence holds for the family of 
functions 7Txig)f ~ f with / G L°°(X) and g E G. This is a straightforward 
adaptation of Riesz's argument described above. 

The proof of the maximal inequality utilizes two separate ideas : the transference 
principle and covering arguments. The transference principle asserts that the max- 
imal inequality for any family of operators iTxiPt) acting on a general space (X, /i) 
can be reduced to the maximal inequality for the convolution operators defined by 
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/3t on the group G. This principle was initiated by Wiener |W39j . and general- 
ized by Calderon jC53j and Coifman- Weiss |CW76j . Finally, the investigation of 
the convolution operators exploits geometric covering properties of the translates 
of sets in a F0lner family Ft. This approach originated with Wiener |W39j . who 
has generalised the Hardy-Littlewood maximal inequality from the real line to Eu- 
clidean space by introducing an elegant and influential covering argument based on 
the doubling property 

\B2t\ ^ const ■ \Bt\ 
for volumes of the Euclidean balls Bt. 

The pointwise ergodic theorem was subsequently established, in turn, for groups 
with polynomial volume growth, exponential solvable Lie groups, and connected 
amenable Lie groups. This was carried out by Templeman |T67] , Emerson-Greenleaf 
|EG74] . Coifman- Weiss |CW76j and others. It should be pointed out, however, that 
a F0lner sequence may fail to satisfy the pointwise ergodic theorem, even for the 
group Z. The pointwise ergodic theorem for general amenable groups requires a 
regularity assumption stronger than asymptotic invariance, namely temperedness, 
and was estabhshed by Lindenstrauss in |L01j . We refer to |N06j for a recent survey 
and to |AABj for a detailed account of the ergodic theory of amenable groups. 

3. From amenable to non-amenable groups 
The proofs of the ergodic theorems indicated in the previous section were founded 



upon the almost invariance property (2.1) of the F0lner families. How then can 



one proceed when the group is not amenable, so that there are no asymptotically 
invariant families of sets at all? 

Before describing our approach, let us note one more consequence of the existence 
of asymptotically invariant sequences of sets in the group. For any properly ergodic 
(i.e., not transitive) action of a countable amenable group G, one can show that 
the space X admits a non-trivial asymptotically invariant sequence of sets, namely, 
a sequences of measurable sets An C X, such that 

< liminf /i(y4„) < limsupyu(y4„) < 1 and fi{gAnAAn) — )■ 

n— ^oo n— >oo 

uniformly for g in compact subsets of G. It then follows easily that there exists an 
asymptotically invariant sequence of functions /„ G L^{X) with zero integral such 
that 

||/„|| = 1 and ||7rx(^)/„-/„||^0 (3.1) 

uniformly for g in compact subsets of G. In fact, the existence of an asymptotically 
invariant sequence in every property ergodic action characterises amenable groups 
(see jSSniJHTQ] ). 

We can therefore conclude that a countable group G is amenable if and only if 
in every properly ergodic action, the averaging operators satisfy 

M(3)\l?jx) =1 (3.2) 



for every probability measure j3 on G, where Lq{X) denotes the orthogonal comple- 
ment to the space of constant functions. In particular, it follows that for a properly 
ergodic action of a countable amenable group, no uniform rate of convergence in 
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the mean ergodic theorem can be estabhshed, namely the norm of the averaging 
operators does not decay at all. 

3.1. Spectral gap property. Our discussion so far has lead us to the following 
observation : when G is a countable non-amenable group, at least some actions X 
have the property that the operators vrx(/3) are strict contractions on Lq(X), namely 
||7rx(/3) \\i2 < 1, for all absolutely continuous generating probability measure (3 on G. 
We can therefore conclude that a non-amenable countable group G necessarily has 
actions which admit a spectral gap, in the sense of the following general definition. 

Definition 3.1. A measure-preserving ergodic action of a group G on a probability 
space {X, fi) has a spectral gap if one of the following two equivalent conditions 
holds: 

• There is no asymptotically invariant sequences of functions as in (3.1). 



There exists an absolutely continuous symmetric probability measure (3 
whose support generates G such that 

MP)\qix) <i- (3.3) 



We remark that it follows that for actions with spectral gap the estimate (|3.3) 



holds for every absolutely continuous probability measure (3 on G whose support 
generates G as a semigroup. 

Kazhdan |K67j has made the truly fundamental discovery that many groups 
satisfy an extremely general form of the spectral gap condition. Not only it is 
satisfied for all ergodic probability preserving actions of the group, an analogous 
property holds in fact for all unitary representations without invariant unit vectors. 

Definition 3.2. A group G has Kazhdan property (T) if one of the following equiv- 
alent conditions holds: 

• Every unitary representation tt of G which has no invariant unit vectors also 



has no sequences of asymptotically invariant unit vectors as in (3.1). 
• For some absolutely continuous symmetric probability measure /3 on G 
whose support generates G, 

sup||7r(/3)||<l, (3.4) 

7r 

where the supremum is taken over all unitary representations of G that have 
no invariant unit vectors. 

Examples of groups satisfying property (T) abound. For instance, the groups 
SL„(]R), SL„(C) and Sp„(]R) with n > 3 have this property. Among simple Lie 
groups, only the isometry groups of real hyperbohc spaces and the isometry groups 
of complex hyperbolic spaces do not have property (T) (the case of rank-one groups 
is dealt by |K69j ). Similarly, simple algebraic groups over non-Archemedian local 
fields all have property (T), unless their split rank is one. We refer to the monograph 
|BHV] for an extensive discussion of this property. 

The isometry groups of real and complex hyperbolic spaces (and algebraic group 
of split rank 1) do in fact have probability preserving actions admitting an asymp- 
totically invariant sequence of unit vectors, namely actions which do not have a 
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spectral gap. Nevertheless, many of the most important and natural actions of 
these groups do have a spectral gap, although it may vary from action to action, 
and it does not obey the remarkable uniform estimate provided by property T. The 
problem of establishing uniform spectral estimates for certain families of actions (or 
families of unitary representations) of these groups is a problem of foundational im- 
portance, which has been studied extensively. Let us now turn to describe some of 
the results and the challenges of establishing such spectral estimates uniformly, and 
then explain the utility of such estimates in proving ergodic theorems with rates of 
convergence for actions of semisimple algebraic groups. 

3.2. Selberg's property and property r. The phenomenon where a family of 
actions with increasing complexity satisfies the spectral gap property uniformly, 



namely the estimate in (3.3) is uniform, is termed property (r). 

The first instance of the property (r) was discovered by A. Selberg in |S65j for 
congruence subgroups of SL2(Z) C SL2(M). Let us consider a sequence of covers of 
finite area hyperbolic surfaces 

e^ -^ > Sn^ >S2^ Si = SL2(Z)\H2. 

We denote by A(S'„) the bottom of the spectrum (excluding the zero eigenvalue) of 

the Laplace operator A = y'^ ( -^ + J^ ) on S'„. One might expect that when the 

covers S'„ grow in a regular fashion, they "approximate" the hyperbolic space IP, 
and X{Sn) approaches A(EI^) = |. In particular, \{Sn) is expected to stay bounded 
away from zero. Here the assumption that covers grow in a regular fashion is 
essential, and A. Selberg constructed an example of a sequence of covers Sn such 
that X{Sn) — )• as n — )■ oo. On the other hand, for the surfaces Si = r(£)\EI^, 
where T{i) = {76 SL2(Z) : 7 = I modi} denotes the congruence subgroup of 
SL2(Z) of level i, he showed that 

X{Se) > ^. (3.5) 

Moreover, A. Selberg conjectured that for the congruence covers, 

XiSe) > A(e2) = i. (3.6) 



Although estima te ([3^ has been improved in [GJ781 ILRS951 |T96i, "KSOS] , the Sel- 
berg conjecture (3.6) is still open. We refer to the surveys [S95. S03J for a more 
detailed discussion. Nowadays this conjecture is understood as a special case of 
the generalised Ramanujan conjectures, discussed in |S05l IBBT3] . which are most 
conveniently stated in the representation-theoretic terms. These conjectures also 
include the original Ramanujan conjecture concerning the bounds on Fourier coef- 
ficients of holomorphic modular forms, which was solved by P. Deligne. 

More generally, let G C GLd(C) be a semisimple algebraic group defined over Q, 
r = G(Z), and let 

r(£) = {7er: 7 = 1 modi} 

denote the congruence subgroups. We denote by 

ndi) = Ll{G{R)/r{i)) 
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the space of square-integrable functions with zero integral on the space G(R)/r{i). 
We consider the family of unitary representation of G(]R) on 'Hg(^) defined by 

7r,((7)0(x) = c^{g-'x), e -Hcii). (3.7) 

The generalised Ramanujan conjectures seeks to describe the decomposition of the 
representations vr^ (and more general automorphic representations) into irreducible 
constituents. We refer to |S051IBB13] for a comprehensive survey of this subject, but 
what would be of crucial importance for our purposes is the uniform spectral gap 



property generalizing (3.5). In the representation-theoretic language, this property 
can be reformulated in terms of isolation of the trivial representation of G(]R) from 
the other irreducible representations appearing in vr^ for all £ > 1 or in terms of 
estimates on the integrability exponents of the representations vr^, which we proceed 
to define. 



If n is a family of unitary representations, we say that 11 has property (r) if (3.4) 
holds with TT G n. It is of crucial for many number-theoretic applications to know 
the uniform spectral gap property for the family of automorphic representations 



(3.7). This problem has a long history and is closely connected with works on 
Langlands functoriality conjectures. In particular, for other forms of the group SL2, 
property (r) follows from Gelbart-Jacquet correspondence |GJ78j . An important 
milestone was reached by M. Burger and P. Sarnak |BS91j who showed that it is 
sufficient to verify property (r) for certain proper subgroups of the ambient group. 
Combined with previous work, this established property (r) for all semisimple Q- 
simple simply connected algebraic groups except the unitary groups. Finally, the 
case of unitary groups was settled by L. Clozel in |C03] . 

In the case of semisimple Lie groups, it is convenient to measure the quality of 
spectral gap in terms of integrability exponents that were introduced by Cowling 
|C79 j. and Howe and Moore |HM79] . as follows. Let vr be a unitary representation 
of a locally compact group G on a Hilbert space "H. For v,w E Ti, we denote by 
Cv,w{g) = (7r(5')f,w) the corresponding matrix coefficient, which is a continuous 
bounded function on G. We define the integrability exponent of the representation 
IT by 

g(7r) = inf {g > : c^_^ G U{G) for all f , w in a dense subset of "H} . (3.8) 

It was shown in |BW80l IC79J that when G is a connected simple Lie group with 
property T, 

supg(7r) < 00, (3.9) 

where the supremum is taken over all unitary representations of G that have no 
nonzero invariant vectors. The value of the supremum has been estimated in [HT. 



IL95t ILZ96t IO02t IN03J . The estimate (3.9) can be considered as a quantitative 



version of property (T). We also note the important fact that the works on property 
(r) for automorphic representations also lead to explicit estimates on sup£>]^ ^(vr^). 

4. Spectral gaps and ergodic theorems 

4.1. Quantitative mean and pointwise theorems for G-actions. We now 

turn to discuss the applications of the spectral estimates described in the previous 
section in the proofs of ergodic theorems. Let (X, /i) be a (standard) probability 
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space with a measure-preserving action of a locally compact second countable G 
equipped with left-invariant Haar measure. For a family of measurable sets Bt of 
G with finite positive measure, we denote by Pt the uniform probability measures 
supported on Bt. We aim to prove ergodic theorems for the corresponding averaging 
operators vrx(/3t) defined by 



Mt3t)f{x) 



\Bt 



fi9-'x)dg, feL^iX). 



(4.1) 



Bt 



In the rest of this section, we discuss quantitative ergodic theorems for connected 
simple Lie groups G (for instance, G = SL„(M)). In order to prove the quantitative 
mean ergodic theorem for the averages TixiPt), we need to estimate the norms 
of the operators 7ix{/3t)\L^{x)^ and this is achieved using the method of "spectral 
transfer", originating in [N98J. The methods consists of two separate steps. First, 
it is possible to reduce the norm estimate of 7r(/3i) in a general representation vr, to a 
norm estimate for the convolution operator XciPt) for the regular representation Xq 
of G acting on L'^{G). This serves as an analogue in this context, of the transference 
principle for amenable groups. Second, it is then necessary to estimate the norm 
of the convolution operators on G. 

Put in quantitative form, the principle of spectral transfer is based on the fol- 
lowing two fundamental facts: 

• If the action of G on (X, fi) has spectral gap, then the representation 
ttx\l^{x) has finite integrability exponent, and it follows that sufficiently 
high tensor power representation {t!'x\l^{x))'^"' embeds as a subrepresenta- 
tion of the regular representation Ac of G. It then follows from Jensen's 
inequality that for even n, 

^x{mLiix)\\<\\xcmf"'- 

The convolution operator XciPt) in the regular representation obey a re- 
markable convolution inequality, known as the Kunze-Stein phenomenon: 
for i3 e U{G) with 1 < r < 2 and / G L^{G), 

W*f\\2<Cr\mr\\fh 

with uniform c^ > 0. This convolution inequality for SL2(M) is due to 
Kunze and Stein |KS60] and in general to Cowling |C78] . The Kunze-Stein 
phenomenon therefore implies that 



|Ag(A)II<c. 



XBt 



\Bt 



Cr\Bt\ 



-1+1/r 



This leads to an estimate for 



71"x(A)|l2(X) 

one extends this estimate for the action o 



and using an interpolation argument, 
G on LP(X), 1 < p < oo. Hence, we 



deduce the following quantitative mean ergodic theorem, originated in |N98] : 

Theorem 4.1. Given a measure-preserving action with a spectral gap of a connected 
simple Lie group G on a (standard) probability space {X, fi) , for 1 < p < oo there 
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exist Cp, 6p > such that for the averaging operators T^xiPt) os in {4-1), 



^xWt)f- fdfi <C,|i?r''' 11/11,, feL^iX). 

J X p 

Moreover, the parameters Cp,6p depend only on p and the integrability exponent of 
the representation TTxlL^tx)- 

It is a remarkable phenomenon that the rate of convergence stated in Theorem 



4.1 depends only on the measure of the sets Bt. Thus, L^-convergence with a rate in 
a general ergodic action requires no geometric, regularity or stability assumptions 
of the family of sets in question, so that the quantitative mean ergodic theorem is 
very robust. 

The pointwise ergodic theore, however, is considerably more delicate, and for its 
validity some stability and regularity assumptions on the family Bf are indispens- 
able. We turn now to discuss these issues in greater detail. 

Definition 4.2. An increasing family of bounded measurable subset Bf, t > 0, of 
G is called admissible if there exists c > such that for all t > to and e G (0, ^o), 

Oe ■ Bf Oe C. Bt+cei 

where O^ denotes the e-neighbourhood of identity with respect to a right invariant 
metric on G, and 

\Bt+e\<il + ce)\Bt\. 

For instance, if G is embedded as a closed subgroup of GL„(]R) and || ■ || is norm 
on M„(]R), one can show that a family 

Bt = {geG: log||^||<t} 

is admissible (see |EMS96l Appendix], |GN10l Ch. 7]). 

Under this mild regularity property, we also establish a quantitative pointwise 
ergodic theorem |MNS00l CTTO] : 

Theorem 4.3. Consider a measure-preserving action with a spectral gap of a con- 
nected simple Lie group G on a (standard) probability space, and let i^xiPt) be the 
averaging operators with respect to an admissible family of sets. Then there exist 
Cp, 6p > such that for every f G L^{X), p > 1, and fi-almost every x E X , 

TTxmfix)- [ fdfi <Cpif,x)\B,r'K 

Jx 

where \\Cpif,-%<C;\\ flip. 

One can also establish pointwise ergodic theorem for general actions without 
the spectral gap assumption, but in this case there is no estimate on the rate of 
convergence. We refer to |GN10j for a detailed account or ergodic theorem for 
semisimple groups. 

4.2. On spectral methods in ergodic theory. Before proceeding with the de- 
scription of the applications of the mean and pointwise quantitative ergodic the- 
orems stated above, let us make several very brief comments on some of the uses 
that spectral methods have found in ergodic theory. 
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1. Euclidean groups : singular averages. The methods of classical harmonic anal- 
ysis on M'^, including singular integrals, square function estimates, restriction the- 
orems, fractional integrals and analytic interpolation have been utilized by several 
authors to prove ground-breaking pointwise ergodic theorems. Pioneering contri- 
bution here were Stein's ergodic theorem for power of a self adjoint operator |St61] . 
and the celebrated spherical maximal theorem in M'^, proved by Stein- Wainger for 
d > 3 |SW78j and by Bourgain for d = 2 |Bo83j . The spherical maximal inequalities 
were used to prove a pointwise ergodic theorem for sphere averages, with the case 
d > 3 established by Jones |Jo93j . and the case d = 2 by Lacey |La95j . Ergodic 
theorems for sphere averages on the Heisenberg group were developed in |NT97j . 

For actions of the integers, the methods of discrete harmonic analysis and espe- 
cially estimates of exponential sums were developed to prove pointwise ergodic the- 
orem for sparse sets of integers, such as square and primes, in Bourgain's pioneering 
work on the subject |Bo89] . For actions of the integer lattices Z'^, ergodic theorems 
were developed for discrete spheres and other averages by Magyar Mag02| . We 



remark that all the ergodic theorems just mentioned depend on the transference 
principle for the groups involved, namely they reduce the proof of the requisite 
maximal inequalities to the case of the action by convolution. 

2. Free groups : radial averages. The problem of establishing ergodic theorems 
for general non-amenable groups was raised already half a century ago by Arnol'd 
and Krylov |AK63j . An important case that has figured prominently in the theory 
ever since is that of the free (non-Abelian) group. Arnold and Krylov proved an 
equidistribution theorem for radial averages on dense free subgroups of isometries 
of the unit sphere §^ via a spectral argument similar to Weyl's equidistribution 
theorem on the circle. Guivarc'h has established a mean ergodic theorem for radial 
averages on the free group, using von-Neumann's original approach via the spectral 
theorem |G69j . The pointwise ergodic theorem for general actions of the free group 
was proved in |N94j for L^-functions, and extended to function in L^, p > 1, in 
|NS94j . The distinctly non-amenable phenomenon of an ergodic theorem with a 
quantitative estimate on the rate of convergence was realized by the celebrated 
Lubotzky-Phillips-Sarnak construction |LPS86l ILPS87J of a dense free group of 
isometries of S^ which has an optimal spectral gap. 

3. Semisimple groups : ball and sphere averages. An important and influential 
general spectral result on ergodic actions of semisimple Lie group has been the 
Howe- Moore mixing theorem |HM79] . a consequence of the decay of matrix co- 
efficients in unitary representations of these groups. Tempelman (see |T92j ) has 
used the Howe-Moore theorem to prove mean ergodic theorems for averages on 
semisimple Lie group. 

Some of the methods of classical harmonic analysis alluded to above were com- 
bined with unitary representation theory of semisimple groups and used extensively 
in the development of pointwise ergodic theorems for radial averages on semisimple 
Lie groups and some of their lattices. Pointwise ergodic theorems for radial averages 
on the free groups were developed in |N94t INS94] . For sphere and ball averages 
on simple groups of real rank one pointwise ergodic theorems were developed in 
|N94bt IN97t INS97] , and for complex groups in |CN01] . Pointwise ergodic theorems 
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with a rate of convergence to the ergodic mean were developed for radial averages 
on general semisimple Lie groups in jMNSOO] and for more general averages in 
|N98]. 

5. Counting lattice points 

Let G be a locally compact second countable group, and F is a discrete subgroup 
with finite covolume. We would be interested in determining the asymptotic be- 
haviour of the number of elements of F contained in a family of increasing compact 
domains Bt in G. The problem of this type arises naturally in arithmetic geometry 
when X C C" is an affine algebraic variety, and one is interested in estimating 
the cardinality of the set of integral points x G X(Z) with bounded norm. The 
techniques discussed in this section can be applied in particular when G C GL„(C) 
is an algebraic group defined over Q, and F = G(Z) is its arithmetic subgroup, and 
more generally to symmetric varieties of G. 

The simplest instance of the lattice counting problem is to estimate the number 
of integral vectors contained in a compact domain B C M.'^. In this case, it is easy 
to see that 

iZ'^nBl = \B\+0{\dB\), 

and typically Wr — )■ as the volume \B\ — )■ cxd, which leads to an asymptotic 
formula. The analogous problem for more general groups presents new significant 
challenges. In particular, for groups with exponential volume growth, \B\ is compa- 
rable to |i9i?|, and the fundamental domain for F in G might be unbounded. Thus, 
in order to establish an asymptotic formula for |F ni?|, one is required to use more 
sophisticated analytic techniques. A number of methods have been developed to 
address this problem, which include, in particular: 

• spectral expansion of the automorphic kernel |D42l IH561 IS561 IP76[ IG83t 
ILPOHI IMW92[ IDR,S98[ IBMWQQI iBnMlT] , 

• decay of matrix coefficients jMn2l [B82l IEM98[ lM^I)7l[BOT2] . 

• symbolic coding and transfer operator techniques |L89i IP95] , 

• the theory of unipotent flows |EMS96] . 

We refer to |B02] for an extensive survey of the lattice counting problem. 

Our approach to the lattice point counting problem in the domains Bt is to study 
the averaging operators defined by Bt in the action of G on the homogeneous space 
X = G/T equipped with the invariant probability measure /x. Given a family of 
domains Bt in G, we consider 

7rx(A)/(x) = ^ / i\9-'x) dg, f e L'{X). 

We now turn to demonstrate that a (quantitative) mean ergodic theorem for these 
operators implies a (quantitative) solution of the lattice point counting problem for 
the famiy of sets Bt-, provided that it satisfies the regularity property of Definition 

Theorem 5.1. Let G he a locally compact second countable group, F a discrete 
lattice subgroup and assume that the family of sets Bt is admissible. 
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(i) Assume the averages TTc/riPt) satisfy the mean ergodic theorem,: 

'2/ 



vrx(A)/- / fdn 
G/r 



^0, feL'iX). 



Then 



\rn Bt 



\B, 



\G/T\ 



as t — )■ oo. 



(ii) Assume the averages 7rx(A) satisfy the quantitative mean ergodic theorem: 



^x(A)/ 



fdfi 



X 



< Co \B, 



2 ' 



feL\x,fi), 



(5.1) 



with C21O > 0. Then for all sufficiently large tj^ 



|rn5i| 



\Bt\ 



\G/T\ 



0{\Btt^ 



im(G) + l^P 



for every p > 0, where the implied constants depends only on the parameters 
in (5.1) and Definition 4-S, and the covolume ofT. 



Let us outline the proof of Theorem 5.1 We first observe that the quantity |rn-Bt| 
can be approximated by suitable averages on space X. Recall that we denote by 
O^ the e-neighbourhood of identity with respect to a right-invariant metric on G, 
which clearly satisfies 0~^ = O^. We normalise the invariant measure on G so that 
r has covolume one. Let Xe be the characteristic function of this neighbourhood 
normahsed to have integral one and fs{gT) = J^-yi^r Xeidl) ■ Then J^ f dp = 1. We 
claim that for h & O^, 

(5.2) 



feig-'hV) dg<\TnBt\< feig-'hV) dg. 

Indeed, observe that 

\Bt n h^o. 



I fs{g-'hr) dg = J2 [ Xeig-'h^) dg = Y, 
If 7 G Bt-ce, then by the admissibility property, 

hjOe C OeBt-ceOs C Bt, 

SO that 

^ \Btnh-f Pel ^Y^ 



10. 



ir n Bt- 



t—ce I 



10, 



Bt n h-jO, 

lOJ 



7ernBt-ce ' ' 7er 

On the other hand, if h'^Oe H Bt j^ ^, then by the admissibility property, 

7 G h-^BtOe C OeBtO, C Bt+ce- 

Hence, 

^\Btnh-fOe\ Y- iBtHh-fOe 

7er 
This proves (|5.2[). 



10, 



E 

-yernBt+ce 



10, 



< ir n B 



t+cs I 



* By dini(G) here we mean the upper local dimension, namely the infimum over d > such 
that |Oe| > e'' for all sufficiently small e > 0, where O^ denotes the e- neighbour hood of identity. 
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It follows from the mean ergodic theorem that for every 6 > 0, 

/i({/ir e G/T : \7rx{/3t)fe{hT) - 1\ > 6}) ^ 

as t — 7- oo. In particular, it follows for all sufficiently large t, there exists ht G Og 
such that 

\7ix{f3t)fe{htT) -1\<6. (5.3) 



Combining this estimate with (5.2), we deduce that 



|r n Bt\ < (1 + 6)\Bt+ce\ < (1 + S){1 + c^e)\Bt\ (5.4) 

and similarly, 

\TnBt\>{l-6){l-c'e)\Bt\ (5.5) 

for sufficiently large t. Since these estimates hold for arbitrary 6,6 > 0, this proves 



the first part of Theorem 5.1 



The proof of the quantitative estimate follows similar ideas, and is equally straight- 
forward. We assume that e is sufficiently small, and in particular the neighborhoods 

Oe7, 7 G r, are disjoint. Then since \Oe\ > e" with n > dim(G), we deduce that 

|2 ,^^ 1-1 



IX. 



e|l2 



I a 



<e' 



The validity of (5.1) now implies that 



-29 



fxiihT e G/T : \nxi(3t)feihT) - 1\ > 5}) < Ca'rV" \Bt 
= 2G2£^^\Bt['^ 1 we observe that the above measure is strictly less than 



Taking 5 

^{OsT) = \Oe\, SO that we can find ht G Oe satisfying (5.3) and deduce estimates 

(5.4)-(5.5) above. Finally, to optimise the estimate, we take e = |5t|~^/*^"+-^'' with 



sufficiently large t. This leads to the second part of the theorem. 



Let us note that Theorem 5.1 can be generalized significantly in two important 
respects. For full details we refer to |GN12aj . 

(1) The quantitative lattice point count extends to families of sets Bt much 
more general than admissible ones. It suffices that the families satisfy the 
following weaker regularity property: there exist c, a > such that for all 
t >to and e G (0,eo)! 



U uBtV 



U,V&Oe 



< (l + ce^) 



n 



uBfV 



which we call Holder well-roundedness. This extends the solution for exam- 
ple to families of sector averages, families associated naturally with sym- 
metric varieties, and many others. 
(2) The quantitative solution to the lattice point counting problem is uniform as 
the lattice varies in the space of lattice subgroups. All that is needed is that 
the averaging operators satisfy a uniform norm decay estimate in the space 
L^(G/r), and that G/T contain an injective copy of a fixed neighborhood of 
the identity in G, as F varies. In particular, uniformity holds over any set of 
finite index subgroups which satisfies property (r). Note that in the ergodic- 
theoretic approach we have taken, this feature of uniformity is completely 
obvious from the proof. 
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In particular, combining the results of this section with Theorem 4.1 and property 
(r) for congruence subgroups, we deduce: 

Corollary 5.2. Let G C GL„(C) be a simply connected simple algebraic group 
defined over Q and r{i) = {76 G(Z) : 7 = id modi} be the family of congruence 
subgroups. Then for any Holder well-rounded family of sets Bf in G(R), there exists 
6 > such that for sufficiently large t, 

where the implied constant is independent of L 

This estimate will play an indispensable role in the next section. 

6. Almost prime points on homogeneous algebraic varieties 

In the present section we will describe a solution to the problem of sifting the 
integral points on certain homogeneous algebraic varieties to obtain almost prime 
points, following |NS10[[O"N12c J. As will be explained in due course, the uniformity 
in the lattice point counting problem established in the previous section will play 
a crucial role. But let's begin with some remarks on the origins of this problem. 

6.1. Sifting along finite-index subgroups. The prime number theorem de- 
scribes the asymptotic distribution of the set P = {2, 3, 5, 7, 11, . . .} of prime num- 
bers: 

|{pGP: p<t}| ~ ast^oo. (6.1) 

More generally, let us consider an integral irreducible polynomial /. We assume 
that the leading coefficient of / is positive and gcd(/(Z)) = 1. Is it true that 
f{x) is prime for infinitely many integers x? The question is answered for linear 
polynomials by the Dirichlet theorem about primes in arithmetic progressions, but 
it turns out to be extremely difficult for polynomials of higher degrees. A natural 
approximation to this problem is the question whether for a fixed r > 1, the values 
f{x) belongs to the set P,, of almost primes infinitely often, where P^ denotes the 
set of integers having at most r prime factors. This problem has been extensively 
studied using sieve methods. For example, it has been shown (see |HRt Sec. 9.5]) 
that when r = deg(/) + 1, there is a lower estimate of correct order: 

\{x eZ: fix) G P^, \x\ < t}\ > const ■ -^. (6.2) 

logt 

Although it is widely believed that under the above necessary conditions on /, the 
same estimate should holds for the set of primes as well, this is currently out of 
reach. In particular, it is not known weither there are infinitely many primes of the 
form ra^ + 1 (see |I78j for the best result in this direction). 

A far-reaching conjectural generalisation of this problem was proposed by Bour- 
gain, Gamburd and Sarnak (see |S08] and |BGS10j ). and termed the saturation 
problem, as follows. Given 

• a Zariski dense subgroup F of GLrf(Z), 



ERGODIC THEOREMS AND NUMBER THEORY 19 

• a linear representation p : GL(;(C) — )■ GL„(C) defined over Q such that 

p(r) c GL„(z), 

• a vector f G Z", 

• a polynomial / G Z[xi, . . . , a;„], 

is it true that f{x), with x G p(r)f , is r-prime infinitely often, and more generally, 
is the set of such points x Zariski dense in the Zariski closure of p{T)v7 
One can also ask for a quantitative version of this question. Setting 

0{t) = {xeTv: \\x\\ < t}, 

is it true that 

\{xeO{t) : f{x) e¥r}\> const- —-^ as t ^ oo, (6.3) 



in analogy with (6.2) 



It was a profound idea of P. Sarnak that classical sieve methods can be applied in 
this noncommutative setting provided that one is able to establish an asymptotic 
estimate for the cardinalities of the sets 

0{t,i) = {xeTv: ||a;|| <t, /(x) =Omod^} (6.4) 

as t — i- oo. It is crucial for the sieve method to succeed that this estimate is 
sufficiently uniform over i. Letting 

r(£) = {7Gr: 7 = idmod£} (6.5) 

denote the congruence subgroup of level i, it is clear that f{jT{i)v) = /(7f ) modi 
for 7 G r/r(£). Thus in order to estimate |(9(t,£)|, it is sufficient to estimate 
cardinalities of the sets 

{x G -fT{i)v : ||x|| < t} for 7 G T/T{i) . (6.6) 

Let us now consider the important special case of lattice actions on principal 
homogeneous spaces, where O = Tv C Gv, F is a lattice in G, and the stability 



group of f in G is finite. Then estimating (6.6) amounts to nothing other than the 
lattice point counting problem, specifically in the intersection of norm balls with 
translates of the orbits of congruence subgroups of F. It is crucial for the sieve 
method to succeed that the solution to the lattice point counting problem in such 
orbits be uniform over the congruence subgroups. The uniform counting property 
holds in these sets when F is an arithmetic subgroup of a semisimple algebraic 
group G (for instance, when F = SL(i(Z)) |NS10j . Thus the required estimates 
for the sieve method is ultimately derived from the uniform spectral gap property. 



namely property (r) discussed in Section 3.1 Indeed, property (r) gives a uniform 



spectral estimate for the ball averages acting on the spaces L^(G(M)/F(£)), which 
ultimately implies a uniform estimate in the lattice point counting problem. 

Establishing a Zariski dense subset of almost prime points on homogeneous va- 
rieties requires the more general uniform lattice point counting results of Corollary 



5.2, Before stating such a result, let us illustrate how the estimates on \0{t, 



comes into play on a simple sieve based on the Mobius function. Recall that the 
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Mobius function //(£) is equal to (—1)^, if £ is a product of r distinct prime factors, 
and is 0, otherwise, and the Mobius inversion formula 

Let Pz be the product of all prime numbers up to z and S{t, z) denotes the cardi- 
nality of the set of x G 0{t) such that f{x) is coprime to Pz- Then 

S{t,z)= Y. \0{t)n{f = k}\ (6.7) 

fc:gcd(fc,P^) = l 

k \ti\gcd{k,P,) ) 

e:e\p^ k:e\k 

te\p. 
Now suppose that for some k,,6 > 0, we have an estimate 

\0{tJ)\=p{i)\0{t)\ + 0{nOit)t') (6.8) 

with some uniform k,6 > 0, where 

|{rt;mod£}n{/ = Omod^}| 
^^' ~ I {Ft; mod £} I ■ 

When z is sufficiently small compared with t, the above computation leads to 



favourable estimate for S{t,z). However, to derive (6.3), one needs to estimate 
S{t, z) when z = 1°" with some a > 0, and the above sieve with the Mobius weights 
is not sufficiently efficient for our purpose. Fortunately, starting with the works of 
V. Brunn in 1920's more efficient sieving weights have been developed. We refer 
to recent surveys [F06l IMoOSl [GilOl IMollOj and monographs [FIl El [HRl |TK] 
for a comprehensive discussion of sieve methods. These techniques allow to deduce 
a favourable estimate on S'(t, z) with z = t" with a > provided that one knows 



(6.8) and asymptotic estimates of the coefficients p{t}- The latter estimates can 
be deduced from the strong approximation property and results on the number of 
points on varieties over finite fields. Ultimately the main difficulty lies in establish- 



ing (6.8), and the crucial initial input is the uniformity in the lattice point counting 
which follows from property (r). 

More generally, it is possible to apply this method to the case of orbits on sym- 
metric varieties of G. In particular, let us mention the following result |GN12ct 
Example 1.11], which shows that quadratic surfaces contain many integral points 
with almost prime coordinates and implies that the set of almost prime points 
is not conta ined in any proper algebraic subset. This example is a partial case 
of Theorem 6.1 below. Let Q{x) = J2ii=i^ij^i^j be a nondegenerate indefinite 



integral quadratic form in d variables. We assume that d > 4 (see |LS10j for 
d = 3 case) and denote by Spin^ the spinor group of Q. We fix v G Z"^ such that 
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n = Q{v) 7^ and consider the orbit O = SpinQ(Z)f which hes in the quadratic 
surface X = {Q{x) = n}. Assuming that the polynomial function / is absolutely ir- 
reducible on X and gcd(/(a;) : x G (9) = 1, there exists explicit r = r{d, deg(/)) > 1 



such that (6.3) holds. 



Turning to sifting on general symmetric varieties, recall that Corollary |5^ estab- 
lishes uniform asymptotics for the number of elements of r{i) contained in a family 
of Holder well-rounded subsets Bt of G(M). More generally, as noted already in 
the discussion of almost prime points above, one can also deal of cosets of r(£) and 
establish the following estimate. For every 7 G r/r(^), 

where 5 > depends only on the supremum of the integrability exponents in the 
representations LQ{G/r{i)), the dimension of G, and the Holder exponent of -Bj's, 



and the implicit constant is independent of i and 7 G T/T(i). Estimate (6.9) can 
be used directly for sieving for almost primes on the arithmetic groups F itself, or 
more generally in its orbits in principal homogeneous spaces, and this was carried 
out in |NS10j . To sieve for almost primes in orbits O = Tv, one may attempt to 
construct a Holder well-rounded family {Bt} of sets in G(]R) such that the map 

rnBt^ 0{t) : 7 ^^ 7t; 



is onto and has fibers with uniformly bounded cardinalities. Then using (6.9), one 
would show that there exists r > 1 such that 

If n R I 

|{7 G F n 5j : f{-fv) G Pr}| > const ■ — as t ^ 00, 



which implies (6.3). This strategy has been realised in |GN12cj when the stabiliser 
of f in G is a symmetric subgroup, namely the set of fixed points of an involution. It 
leads to the following result (see |GN12cl Theorem 1.10]) on the saturation problem 
for symmetric varieties. 

Theorem 6.1. Let G C GL„(C) be a connected Q-simple simply connected al- 
gebraic group, F = G(Z), and f G Z"' 6e such that StabG(f) is connected and 
symmetric. Then given any polynomial / : Z" — )■ Z such the function g 1— )■ f{gv) 
is absolutely irreducible on G and gcd(/(x) : x E O) = 1, there exists explicit 
r = r(G,f , deg(/)) > 1 for which the estimate |g.5| ) holds. 

6.2. Bounds on integral point satisfying congruence conditions. Another 
fundamental question regarding the distribution of prime numbers is the Linnik 
problem. By Dirichlet's theorem, for every coprime 6, £ G N one can find a prime 
number p satisfying 

p = b modi. 

Y. Linnik |L44at IL44bj raised the question whether this congruence can be solved 
effectively, and showed that there exist io,a > such that one can find a prime 
p satisfying p < i" for all i > Iq. This problem admits a sweeping generalization 
to the noncommutative set-up, namely estimating the norm of a minimal prime 
solution to a system of polynomial equations satisfying a congruence condition. 
The latter problem is out of reach at this time, but it is possible to give a general 



22 ALEXANDER GORODNIK AND AMOS NEVO 

solution to the problem of bounding the norm of an almost prime solution, as 
follows. 

Let O G Tj'^ he an orbit of an arithmetic group and / : (9 — ?■ Z a polynomial 
function. We would like to find a small solution of the congruence 

f{x) = bmodi, xeO, (6.10) 

with f{x) being almost prime. Using the standard sieving techniques combined 



with the estimate (6.9), we prove the following (see |GN12cl Theorem 1.13]): 



Theorem 6.2. Let G C GL„(C) be a connected Q-simple simply connected alge- 
braic group, r = G(Z), and f G Z". Then given a polynomial / : Z" — t- Z such the 
function g h- >■ f{gv) is absolutely irreducible on G and gcd{f{x) : x E O) = 1, there 
exist explicit r,io,a > such that for every coprime b,i E N satisfying i > io and 
b E f{0) modi, one can find x with \\x\\ < i"' satisfying (6.1(^ such that /(a 
r-prime. 



[X] IS 



The subject of sieving along orbits of noncomutative groups is a rapidly devel- 
oping area now. In particular, we mentionthe extensive developments regarding 
Apollonian packings |S07t IS091 IKUllt IBFllj and orbits of "thin" subgroups more 
generally JBGS061 [K09l IBGSIOI iBKTOl [KUT2] . We refer to [H O] for recent com- 
prehensive surveys. 

7. Intrinsic Diophantine approximation on homogeneous varieties 

7.1. Diophantine approximation and ergodic theory. In the theory of Dio- 
phantine approximation one is interested in finding an efficient rational approxima- 
tion for vectors x G W^. More explicitly, one would like to know for what range of 
parameters e and i?, the system of inequalities 

||x — r|| < e and D(r) < R 

has a solution r G Q'^, where || ■ || denotes the maximum norm, and D(r) denotes 
the denominator of a rational number r written in reduced form. In this section we 
consider a more general question regarding intrinsic Diophantine approximation on 
algebraic sets, namely approximation by rational points on the variety itself. Let 

X = {/i(x) = --- = /fc(x) = 0}cC" 

be an algebraic set defined by a family of polynomial with rational coefficients. For 
a real points x in X(]R) which is contained in the closure of the set X(Q) of rational 
points, we would like to produce an estimate that quantifies density. This amounts 
to solving the system of inequalities 

||x-r||<e and T){r) < t"'^ (7.1) 

with r G X(Q). The problem of Diophantine approximation on general algebraic 
varieties by rational points on it was raised in the 1965 survey by Lang |L65j . This 
question cannot be addressed in full generality because it is not even known how 
to determine whether X(Q) is finite or infinite. Here we propose an approach to 
it when X(Q) has an additional structure, namely when X(Q) is equipped with a 
group action. We note that when X is an elliptic curve (or, more generally, an 
abelian variety) the problem of Diophantine approximation on X has been studied 
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by M. Waldschmidt in |W99j . We have considered the problem of Diophantine ap- 
proximation on an algebraic set X equipped with a transitive action of a semisimple 
algebraic group G defined over Q. A key new feature of our approach is that it 
allows approximation by rational points satisfying an arbitrary set of integrality 
constraints, and not only by the set of all rational points. To simplify our nota- 
tion for this exposition, we consider the problem of Diophantine approximation by 
the rational points in X(Z[l/p]) where p is prime. The problem of Diophantine 
approximation by points in X(Z[l/pi, . . . , 1/ps]) (say), or by X(Q) can be han- 
dled similarly, but uses analysis on adelic spaces which would require much more 
elaborate notation. 

Let us illustrate our general result stated below by the following example, men- 
tioned in the introduction. Let X be a rational two-dimensional ellipsoid such that 
X(Z[l/]9]) is not discrete in X(M). Then we prove that 

• for almost every x G X(]R), every n > 2, and e G (0,eo(a;, /t)), the system of 
inequalities 

||x-r||<e and D{r) < e~'' (7.2) 

has a solution r G X(Z[l/p]). 

• for every x G X(]R), every k > 4, and e G (0, eo(/t)), the system of inequalities 

||a;-r||<e and D{r) < e'" (7.3) 

has a solution r G X{Z[l/p]). 



We note that the estimate (7.2) is essentially the best possible. Indeed, the number 



of r G X(Z[l/j9]) with D(r) < R grows as const • i? as i? — ;■ oo and dim(X) = 2, so 



that one can deduce from the pigeon-hole principle that k > 2 in (7.2). 
For a general algebraic set X, we set 

log|{rGl]nX(Z[l/p]): B{r) < R}\ 
ap[X) = sup limsup — — — 

compact Q C X(K) i?-5-oo log(-fx) 

Then again it follows from the pigeon-hole that the exponent k = ^.L-^ in (|7.1|) is 



the best possible. 

We note that rational ellipsoids constitute the only example for which we are 
aware of previous results in the literature, as follows. Diophantine approximation 
by points in X(Z[-]) for X = S"^ and X = S^ can be deduced from the celebrated 
construction by Lubotzky-Phillips-Sarnak of a dense subgroup of S0(3, M) with en- 
tries in Z[-] possessing the optimal spectral gap jLPS86irLPS87j . A rate of uniform 

approximation by all rational points in the spheres S"^, d > 2 was established by 
Schmutz |Sch08] . using elementary methods based on rational parametrizations of 
the spheres, the rate being given by k = 21og2(c? -|- 1). Duke |Du03j has estab- 
lished equidistribution of the set of all rational points on S"^, from which a rate of 
uniform approximation can be derived, and the same method applies to spheres of 
any dimension. Finally, we mention the remarkable recent results of Kleinbock and 
Merrill jKM] , which give the best possible results for approximation by all rational 
points on spheres of any dimension, establishing complete analogs of Dirichlet's and 
Khinchin's theorems in this case. 
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Turning now to a general variety, in order to study the density of the set X(Z[l/p]) 
in X(]R), we construct a dynamical system which encodes it in some sense. This 
turns out possible because the set X(Z[l/p]) is parametrised by the orbits of the 
group G(Z[l/p]). It is crucial for the classical theory of Diophantine approximation 
that Z"^ forms a lattice M*^, and in order to have a similar framework in our setting 
we consider the topological group G(]R) x G(Qp) (where Qp denotes the field of 
p-adic number) equipped with an invariant measure. Then the group G(Z[l/p]) 
embeds diagonally in G(]R) x G(Qp) as a discrete subgroup with finite covolume. 
We consider the space 

Y = (G(R) X G(Qp))/G(Z[l/p]). 

The group G(Qp) acts naturally on this space preserving the finite measure. This 
is the dynamical system that plays a crucial role in our considerations. It turns out 
that from the dynamical systems point of view, the original problem of Diophantine 
approximation corresponds to the shrinking target property. Suppose that we have 
an increasing sequence of compact subsets Bn of G(Qp) and a shrinking sequence 
of neighbourhoods Os in Y . The shrinking target problem asks for what range 
of parameters n and e the orbits BnU with y & Y reach the neighbourhoods O^. 
While in the classical theory of dynamical systems, this question is asked for a 
one-parameter flow, here we are required to answer a similar question for orbits 
of a large groups G(Qp). The connection between Diophantine approximation in 
W^ and the shrinking target property of one-parameter flows have been previously 
exploited by S. G. Dani, D . Kleinbock, and G. Margulis [D85l lD86l [DSQllKQSal 
iKMbl IKM98[ IK991 IKM99J . These developments are surveyed in jKOTl lMn2l[KTn] . 
Also in the context of negatively curved manifolds, this connection was explored 
by S. Hersonsky and F. Pauhn [HPOli IHPn2a[ [HPok^ . 




Figure 1. Shrinking target property 



7.2. Diophantine approximation and the duality principle. To address the 
problem of Diophantine approximation in our setting we take 

Br, = {geG{%): ||(?||p<pn (7.4) 

and construct a shrinking family of neighbourhoods Oe in F. An important prop- 
erty of Oe is that it models the family of neighbourhoods ||a; — r|| < e in X(]R) and 
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in particular |0e| decays as const ■ g^i^C-^) when e — > 0^. We show that every point 
X e X(]R) is naturally associated a point y^ G F, and the system of inequalities 

||x — r|| < e, D(r) < p" 

has a solution r G X(Z[l/p]) provided that 

5-1 l/,.n 0,^0. (7.5) 

This leads to a useful connection (summarised in Table 1) between the problem 
of Diophantine approximation on X and the distribution of orbits of F. In the 
classical setting an analogous correspondence was observed by S. G. Dani in |D85j 
and called the Dani correspondence in |KM98j . Since our construction is somewhat 
similar, we call it the generalised Dani correspondence, see Table 1 below. 



Diophantine approximation 


Dynamics 


X(Z[l/p]) C X(M) 


G(Qp) r> Y 


X e X(M) 


y.&y 


X — r| < e 


0,CY 


1 \ X — T \ ^ € 

< 1 / \ ^~ «' has a Z[l/pl-solution 
[ den(r) < p" ^ ' ^^ 


5-iy,n 0,7^0 



Table 1. Generalised Dani correspondence 



7.3. Spectral estimates. We prove the shrinking target property by analyzing 
the behavior of suitable averaging operators on the space Y. Let /3„ denote the 



uniform probability measure supported on the set i?„ C 
sequence of averaging operators 

nrWn) : L\Y) ^ L\Y) 

defined by 



Then we have the 



7i"y(/3„)0(2/) 







(I)i9-'y)d^^{g), <peL\Y). 



Ideally, one hopes that as in the classical von Neumann ergodic theorem, the av- 
erages 7ry(/5„)0 converge to the space average /^^ as n — )■ oo, but this is not 



always true due to presence of nontrivial unitary characters of G(Qp) in the space 
L'^iY). We denote by Lqq{Y) the subspace of L'^{Y) orthogonal to all characters 
of G(M) X G(Qp). Then one can show that T^Y{Pn)'P — !■ in L^-norm for every 
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G Lqq{Y). The rate of convergence here will play a crucial role. This rate is 
deduce from a variant the spectral gap property, which was discussed in Section 
4 and Section [6} and is determined by a suitable integrability exponent. We fix 
a good maximal compact subgroup Up of G(Qp). For functions 4>,ip E L^(Y), we 
denote by c^^tp{g) = Jy <P{gy)4'{y) dy the corresponding matrix coefficient. Similarly 



to (3.8) we define the spherical integrability exponent 

qpiG) = inf {g > : c^,^ G L«(G(Qp)) for all f/^-inv. 0,V^ G Ll,{Y)} . 

It follows from the result of L. Clozel jC03] that the integrability exponent qp{G) is 
finite, and we establish the following quantitative mean ergodic theorem: for every 
sequence 5„ of C/p-bi-invariant compact subsets of G(Qp), every G LIq(Y) and 
S>0, 

wmm\2 < const(5) ■ ii?nr^+'ii</)|i2. (7.6) 

This mean ergodic theorem is vital for establishing the required shrinking target 
property. We note that in order to derive optimal (or close to optimal) exponents 



for Diophantine approximation in (7.1) it is crucial that the error term is con- 
trolled by the L^-norm rather than a Sobolev norm, which compromises the rate. 



The actual value of the exponent of |-B„| in (7.6) is the crucial parameter which 
ultimately controls the quality of Diophantine approximation, as elaborated further 
below. Thus this approach provides a connection between the generalised Ramanu- 
jan conjectures and the problem of Diophantine approximation on homogeneous 
varieties. 

Now coming back to the original Diophantine approximation problem, we ap- 
ply estimate (7.6) to the averages supported on the sets i?„ as in ( 7.4[ ) (or, more 



precisely, on the [/p-bi-invariant sets UpBnUp). We also estimate the behaviour of 
these averages on the orthogonal complement of L'^^iY) in LF'iY). Since the orthog- 
onal complement is finite-dimensional, this does not present a particular challenge. 
Assuming that 

lOJ > const ■ |5„r^+^ 



with fixed 5 > 0, we use (7.6) to produce an upper estimate on the measure of the 
seiy eY for which B'^yHOg = 0. Then using a Borel-Cantelli-type argument, we 
deduce a shrinking target property that holds for almost every y eY. This implies 
a Diophantine approximation result via the generalised Dani correspondence (Table 
1). If we impose a stronger assumption that 

|0e| > const ■ |fi„r^+^ 

with some fixed 6 > 0, then the above argument can be modified to show that 
B~^y n Og = for all y E Y provided that e is sufficiently small. This produces a 
Diophantine approximation result which is valid for all points in X(Z[l/p]). 

This completes the outline of the following result, which is part of a more general 
theorem stated in |GGN12( Theorem 1.6]: 

Theorem 7.1. Let X be an algebraic set defined over Q equipped with a transitive 
action of connected almost simple algebraic group G defined over Q. We assume 
that X{7j[1/p]) is not discrete in X(]Rt|] Then 



''^One can show that under these assumtions X(Z[l/p]) is open in X(]R). 
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for almost every x G X(Z[l/p]) C X(]R), every 6 > 0, and e G (0, eo(x, (5)), 
the system of inequalities 

/ di.n(x) ,\1p(G)/2 

||x-r||<e and D(r) < I e "pf^) M (77) 

has a solution r G X(Z[l/p]). 

/or every x G X(M), ever?/ 5 > Q, and e G (0, eo(5)), i/ie system of inequalities 

dim(X) ^\ gpC^) 

Is — rll < e anc? Dfr) < ( e "p^*^"' 



/ias a solution r G X(Z[l/]9]). 

In the paper |GGN] (joint with A. Ghosh), we also prove analogues of Khnichin's 
and Jarnik's theorems for Diophantine approximation on homogeneous varieties. 



The quality of Diophantine approximation in Theorem 7.1 is controlled by the 
integrability exponent qp{G). When gp(G) = 2, then estimate (7.7) gives the best 
possible result up to arbitrary small 6 > 0. Coming back to the example (7.2), we 
note that in this case G is the orthogonal group and qp{G) = 2, which follows from 
the work of P. Deligne |D71j on the classical Ramanujan conjecture (see [C] for a 
comprehensive treatment) combined with the Jacquet-Langlands correspondence 
|JL70] . This topic is surveyed in [Rj. 

In conclusion we note the intriguing fact that conversely, Diophantine approxi- 
mation considerations can provide insights in the direction of the generalised Ra- 
manujan conjectures. In particular, one can deduce estimates on the integrability 
exponents. For instance, analysing the quality of Diophantine approximation in M*^ 
by the rational points in 7j[l/pY, one can deduce that 

qp{SU)>2{d-l). 

We refer to |GGN12l Corollary 1.8] for a more general result in this direction. 

8. ERGODIC THEOREMS FOR LATTICE SUBGROUPS 

8.1. Ergodic theorems and equidistribution. We turn to discuss ergodic the- 
orems for countable groups such as F = SL„(Z) and more generally for arithmetic 
subgroups in semisimple algebraic groups. When G is a connected semisimple 
group, we have approached this problem via spectral methods, and detailed in- 
formation about the unitary representation theory of G was essential. However, 
for discrete lattice subgroups this approach is not feasible, since the irreducible 
unitary representations of a lattice cannot be classified (in a Borel manner), and 
a general unitary representation of a lattice cannot be represented uniquely as a 
direct integral of irreducible ones. Thus non-amenable discrete groups appear as 
the hardest case in ergodic theory because they possess neither an asymptotically 
invariant family of sets, nor a usable spectral theory, and so other approaches must 
be developed. 

Although our approach can be applied to lattices in general groups, for the sake of 
keeping our exposition simple we will present it for lattices F in connected simple 
Lie groups G. For more general formulations, we refer to |GN10j . We take a 
family of increasing compact domains Bt in G, set F^ = F fl -B^, and denote by 
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Xt the uniform probability measures supported on the sets Tf. Given an arbitrary 
measure-preserving ergodic action of F on a (standard) probabihty space {X,fj,), 
we consider the corresponding averaging operators 



nx{Xt)f{x) 



h E /(7-^^ 



/ e L^{X). 



i&t 



In order for the sets Fj to behave in a regular manner, we assume that the domains 



Bt are admissible in the sense of Definition 4.2 This mild regularity assumption 
turns out to be sufficient to establish general quantitative ergodic theorems: 

Theorem 8.1. Assume that the action ofT on {X,fi) has a spectral gap. Then the 
following holds. 

• Mean ergodic theorem : For every f G Lp{X) with 1 < p < oo, 



vrx(Ai)/ 



fdfi 



X 



< Cp\Tt\ 



with uniform Cp,9p > 0, independent of f . 
Pointwise ergodic theorem: For every f G 
almost every x, 



L^{X) with p > 1 and for ^- 



7rx(At)/- fdfi 
Jx 



<Cp{f,x)\Tt 



with uniform 6'^ > 0, independent of f and x. 

We emphasize that this result holds for all F-actions which have a spectral gap, 
and furthermore, if F has property T of Kazhdan, the parameters Cp, Op and 9' are 
independent of the action. The only connection to the original embedding of F in 
the group G is in the definition of the sets F^. We remark that an L^'-norm bound 
for Cp{f, ■) (for a suitable s depending on p) holds. Finally, when the action of F 
does not have a spectral gap, we also establish mean and pointwise ergodic theorem 
(but without a rate of convergence, of course). We refer to [GNlQj for details. 



Theorem |8.1| demonstrates a remarkable contrast with the ergodic theory of 
amenable groups where no rate of convergence can be established, as discussed 



in Section 3. Let us give an example to illustrate Theorem 8.1 



Consider the action F 



SL„(Z) on the torus ^ 



r/Z" and set 



Fi = {7GF: loghll < t}. 
Then for every / G L^(T") with p > 1 and almost every x G T", 



1 



TGFt 



X] 



fdfi 



T" 



<Cpif,x)\Tt\ 



with uniform 6' > 0. 



The orbit structure in the above example is quite complicated. The torus contains 
the dense set of rational points which consists of finite F-orbits, but the orbit 
of every irrational point is dense. In particular, the size of the estimator Cp{f,x) 
depends very sensitively on Diophantine properties of x, and it would be interesting 
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to make this dependence more explicit. In the setting of random walks on the torus, 
a similar problem was recently investigated in |BFLMlT] . 

In the case of isometric actions of lattices, we derive the following quantitative 
equidistribution result, which again has no classical amenable analogue: 

Corollary 8.2. Let T act on isometrically on a compact Riemannian manifold X 
equipped with a smooth measure /i of full support. If the action has spectral gap, 
than for every Holder function f G C"'{X), and for every point x in the manifold 

n^ E /(7''^) = [ fdf^ + (|r,r^1|/||cO 

with uniform 6a > 0. 

8.2. Method of the proof of the ergodic theorems. Our approach to proving 
the ergodic theorems for the averages vrx(Af) is based on the idea of inducing the 
action of F to obtain an action to G on a larger space Y = {G x X)/r. The latter 
space has the structure of a fiber bundle over the space G/T with fibers isomorphic 
to X. We then reduce the ergodic theorems for the averaging operators vrx(Ai), 
supported onTnBt and acting on L^(X), to the ergodic theorems for the averaging 
operators 7iY{Pt), supported on the domains Bt and acting on Lp{Y). This involves 
developing a series of approximation arguments which constitute generalisations of 
the argument used in the solution of the lattice point counting problem in Section 
|5| The essence of the matter is to approximate 

by 






T^Yi(3t±ce)fe{y) = T7^ T / feig ^V) ' 



\Bt±C€\ JBt±c, 

with suitably chosen /^ G Lp{Y). The link between the two expression is given by 
setting y = {h,x)T and 

7er 

where Xe is the normalized characteristic function of an identity neighborhood O^. 
The ergodic averages T^Y{.Pt±ce)fe can then be rewritten in full as 

1 



E 



Xe{g ^h^)dg] 0(7 ^x 



I R 

^gP vl-Dtibcel J Bt±c 

We would like the expression in parentheses to be equal to 1 when 7 G Ffl-Bi^ce and 
equal to when 7 ^ F fl Bt+ce, in order to be able to compare it to 7ix{\t)<P- These 
favourable lower and upper estimates depend only on the regularity properties of 
the sets Bt, namely, on the admissibility property. Indeed, the lower bound arises 
since if Xe{g~^h'j) 7^ and g G Bt-ce, then 7 G -B*. The upper bound holds since 
for 7 G F n Bt, the support of function g 1— )■ Xeig'^h'y) is contained in Bt+ce- 

A fundamental point in the completion of the proof of the ergodic theorems is 
played by an invariance principle, namely by the fact that for any given / G L^iY), 
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the pointwise ergodic theorem for the averages vry (/3t) holds for a set of points which 
contain a G-invariant set of full measure. Since the space Y = {G x X)/r is a G- 
equivariant bundle over the G-transitive space G/T, this implies that for every 
point hr, the set of points a; G X in the fiber above it where the pointwise ergodic 
theorem holds is conull in X. This allows us to deduce that the set of points where 
7i"x(At)0(a;) converges is also a set of full measure in X. 

9. Distribution of orbits on algebraic varieties 

9.1. Distribution of orbits in the de-Sitter space. Let us now turn to dis- 
cuss the distribution of dense orbits for actions of arithmetic groups on algebraic 
varieties. These action do not preserve a finite measure (besides the measures sup- 
ported on periodic points), so that this problem falls in the realm of infinite ergodic 
theory. 

Consider an action of a discrete group F on a space X. The distribution of orbits 
of r is encoded by the properties of the averaging operators 

Mix) = ^ V <P{l-'x) (9.1) 

defined for an increasing sequence of finite subsets of F that exhaust F and for a 
family of test functions on X. For example, if F = (7) is a cyclic group, and the 
action preserves a (possibly infinite) measure /i, then as in Section 1 we obtain that 

1 ^ 

i^c^WTl ^ <Pir^)-^'P<Pi^) mL2(X), (9.2) 

~^°° n=-N 

where V denotes the orthogonal projection from L'^{X) to the subspace of F- 
invariant functions in L^(X). However, if the measure is infinite and the action 



is ergodic, then V = 0, and (9.2) contains no information about the distribution 
of orbits. Initially, one might expect that if we pick a correct normalisation fac- 
tor such that ^^ — )■ 0, then the averages -^^A^ ^n=-N^i'y^-'^) exhibit a nontrivial 
limit. But in fact, J. Aaranson [Aa| §2.4] showed that if the action of the cyclic 
group F = (7) is ergodic and conservative, then for any normalisation a{N), either 
for every nonnegative (p ^ -^^(^)) 

1 ^ 
liminf N 0(7"x) = almost everywhere, 

N^oo a{N) ^-^ 

^ ' n=-N 

or there exists a subsequence Nk such that such that for every nonnegative G 
LHX), 0^0, 

lim — } 0(7"x) = 00 almost everywhere. 



-N, 



But while the averages for orbits of Z-actions on infinite measure spaces tend 
fluctuate wildly, it turns out that orbits of "large" groups behave in a more regular 
fashion. We will presently demonstrate this point by computing the distribution 
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of the orbits of the groups of isometries of the (i-dimensional de Sitter space dSd- 
This space can be reahsed as a hypersurface 

2 I I 2 2 1 

Xi-\- ■ ■ ■ -\- x^ — a^rf+i — i 

in W^^^ equipped with the Minkowski metric ds^ = dx\ + ■ ■ • + dx"^^ — <ia;^^_]^. The 
problem of distribution of orbits on the de Sitter space was raised by V. Arnol'd 
[Art 1996-15, 2002-16]. After stating the result, we will explain a general strategy 
how to study distribution of orbits on homogeneous spaces. Our exposition is based 
on |GW07[[GN] . We also refer to p9l lN02l IMJM ILPM lOM [GHil [CM05l iG^Ol 
iNlOl IMWllj for related results. 




Figure 2. The de Sitter space 

In the setting of Arnol'd's question regarding the de Sitter space, we prove the 
following. Let F be a discrete subgroup of Isom((iS'rf) with finite covolume. We 
identify Isom((i5'd) with the orthogonal group SO((i, 1) and denote by || ■ || the 
standard Euclidean norm on the space of {d + l)-dimensional matrices. 

• When d = 2, for every G L}{dS'^) with compact support 

lim - > (p^'y^^x) = I (pdu almost everywhere, (9.3) 

t-S>00 t ^-^ had 

7er:log||7||<t "^"-^ 

where v denotes a (nonzero) invariant measure on dSd (whose normalization 
depends on F). 

• When (i > 3, for every continuous function <^ with compact support and 
every x G dS''' with dense F-orbit, 

l^^t E <Pil-'x)=fjdu., (9.4) 

where the limit measure is diy^{y) = (^^||^n2)(d-2)t2(f+||^||2)(d-2)/2 - 

Moreover, if in addition is subanalytic and Sobolev, then for almost 
every x G dS''', 



-(^ J2 ^il-'^) = I ^ <f>iy) dvM + 0^,.(e-'*) (9.5) 

7er:log||7||<t '''^^'^ 

with 5 = 6{(j),x) > 0. 
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The above equidistribution results have several remarkable features. In the case 
of dimension 2, while the cardinality of the set {7 G F : log||7|| < t} grows 
exponentially as const ■ e*, it turns out that only for a polynomial number of points 



7 does the orbit points •yx in (9.3) come back to a compact set. Nonetheless, the 
small fraction of points returning by time t becomes equidistributed in the compact 
set, with respect to the invariant measure as t — )■ cxd. 

When d > 3, while the cardinality of the set {7 G F : log||7|| < t} grows as 
const ■ e^'^"^)*, only an exponentially small fraction of them satisfy that •yx returns 
to a compact set. The set of returning points does become equidistributied in the 
compact set, but this time the limiting measures z/^ are not invariant under F, and 
furthermore depend nontrivially on x. The measures u^, x G X, on the pseudo- 
Riemannian space X should be considered as analogues of the Patterson-Sullivan 
measures in this context. 

9.2. The duality principle on homogeneous spaces. The techniques devel- 
oped in |GNj apply in the following general setting. Let G C SL(i(]R) be connected 
and of finite index in an algebraic group, and F a discrete subgroup of G with finite 
covolume. Let X be an algebraic homogeneous space of G equipped with a smooth 
measure on which F acts ergodically. We fix a proper homogeneous polynomial 
p : Mrf(]R) — )■ [0, 00) and consider the sets 

Fi = {7 G F : logp(7) < t}. 

Our goal is to describe the asymptotic distribution of orbits of F, or in other words 
the asymptotic behaviour of the sums J^-yert <^(7~^^) as t — )■ 00 for a sufficiently 
rich collection of functions on X with compact support. In order to find the right 
normalisation for the sum Xl^er 4'{i~'^^)j o^^^ needs to compute what proportion 
of points in the orbit returns to compact subsets Q G X. Given a compact subset 
Q oi X with non-empty interior and x G X, we set 

loglF^-^xn^l 



a = lim sup 

One can check that this quantity is independent of the choices of Q and x. We will 
distinguish two cases according to whether a = 0, i.e., the return rates are at most 
subexponentially, and a > 0, i.e., the return rates are exponential. 

Theorem 9.1 (polynomial return rates). Assume that a = 0. Then there exists 
b G Z>o such that the averaging operator 

■y&t 
satisfy 

• (strong maximal inequality) For every p > 1, a compact D G X , and 

sup \At(f)\ < const(p, D) ■ ||0||lp(15) 
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• (pointwise ergodic theorem) For every G L^{X) with compact support, 
lim ^t0(x) = / (pdu almost everywhere, 

where u is a (nonzero) G-invariant measure on X . 

When a > 0, we prove analogous results for functions lying in a suitable Sobolev 
space L^{X). In this case it is also possible to establish rates of convergences. 

Theorem 9.2 (exponential return rates). Assume that a > 0. Then there exist 
b e Z>o and I G Z>o such that the averaging operator 



Mix) = -^ 5^ 0(7"' ■ x) 



■yert 

satisfies 

• (strong maximal inequality) For every p > 1, a compact D C X, and a 
nonnegative G L^{D), 



t>to 



< const{p,D) ■ miLfiD) 

LP{D) 

(pointwise ergodic theorem) For every p > 1 and a nonnegative bounded 
(f) G L^i{X) with compact support, 

lim >lt0(x) = / (pdux almost everywhere, (9.6) 



where v^, x & X , is a family of absolutely continuous measures on X with 
positive densities. 
• (quantitative pointwise ergodic theorem) For every p > \ and a nonnegative 
continuous subanalytic G L^i{X) with compact support, 

At(t){x) = I 0^1/2: + >^Cj(x, 0)t^* + O^^.' (e~^*) almost everywhere (9.7) 
J^ i=i 

with 6 = S{x,(f>) > 0. 



If in Theorem 9^ we additionally assume that the stabiliser of a point x in G is 
semisimple, then the stated results can be further improved. One can replace the 
Sobolev norms by L^ norms (see jGNl Theorem 1.3]). Moreover, one can show (see 
|GW07j ) for continuous functions with compact support, the asymptotic formula 



(9.6) holds for all x G X whose F-orbit in X is dense. 



9.3. Equidistribution in algebraic number fields. It seems unlikely that in 



the generality of Theorem 9.2, the asymptotic expansion (9.7) can be proved for 
an explicit set of x G X of positive measure, but there is an important case when 
this can be achieved. Assuming that the action of F on X preserves a Riemannian 
metric, one can show (see |GN12bj ) that all F-orbits in X become equidistributed 
with a rate. The observation that all orbits for isometric actions on compact spaces 
are equidistributed goes back to the work of Y. Guivarc'h |G69j . and in |GN12bj 
we have developed a quantitative version of this argument which also applies to 
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infinite-measure setting. For instance, fix a non-square integer d > and let us 
consider the action of F = SL2(Z[vu]) on the upper half plane H^ by fractional 
linear transformations. We set 

F, = {7GF: log(||7f + ||7f)<t} 

where || ■ || is the standard Euclidean norm, and 7 denotes the Galois involution 
of the field Q{\/d). Since F embeds diagonally in G = SL2(M) x SL2(M) as an 



irreducible lattice, and H^ is a homogeneous space of the group G, Theorem |9.2 
applies in this case and gives information about the asymptotic distribution of orbits 
Fx for almost every x G X, and the method of jGN12b| Theorem 1.5] leads to the 
following equidistribution result: there exists 6 > such that given a compact 
subset D C H^, for every x E D and every G C^(EI^) with support contained in 



-V0(x7)= / (P{z)du{z) + OD{e- 



76rt 
where i/ is a (nonzero) G- invariant measure on H^. 

9.4. Ideas of the proof. In conclusion we indicate some ideas that lie behind the 



proof of Theorem 9.1 and 9.2 The argument can be divided into two main steps: 



(I) compare the asymptotic behaviour of discrete averages 



^ 0(7"'a;) (9.8) 



7er:logp(7)<t 



with the asymptotic behavior of continuous averages 

(t){g-'x)dmG{g), (9.9) 



/ 



' geG:logp{g)<t 

where mc is an invariant measure on G. 



(II) establish the asymptotics for the continuous averages (9.9). 

To achieve step (I), we think of G as a fiber bundle over X, and show that the 
fibers of this bundle become equidistributed on the space F\G. More explicitly, we 
identify X with the homogeneous space G/H where if is a subgroup of G. For 

u,v E G, we set 

Ht[u,v] = {h E H : \ogp{uhv) < t}. 

Then we show that the sum (9.8 ) can be approximated by a finite linear combination 
of integrals 

/ fiiVih) dmH{h) 

J Ht^[ui,Vi] 

with suitably chosen tj ^ t, Ui,Vi E G, fi : T\G — t- M, and i/i G F\G, where itih is 
a left invariant measure on H. The asymptotic behaviour of these integrals can be 
analysed using either representation-theoretic techniques as in jGN] or the theory 
of unipotent fiows as in |GW07] . In both cases we conclude that as t — t- cxd, 

vrri Y\ / fiyh)dmH{h)^ / f dmr\G, 



ERGODIC THEOREMS AND NUMBER THEORY 35 

where mr^G denotes the normahsed invariant measure on r\G. Finally, this allows 
to conclude that the above linear combination is approximately equal to the integral 



(9.9), which completes step (I). 

In step (II), we establish an asymptotic formula for integral 

v{t) = / (t){g'^x)dm{g) 

JgeG:logp(g)<t 

where (f) is a continuous subanalytic function with compact support. For this, we 
consider its transform 



f{s)= / t-%{\ogt)dt 
Jo 
which converges when Re(s) is sufficiently large. Using a suitable version of resolu- 
tion of singularities as in |P94j we show that f{s) has a meromorphic continuation 
beyond the first pole. Then the asymptotic formula for v{t) follows via a Tauberian 
argument. 
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